Name | Version | Summary | date |
upspawn-ocr-cli |
0.1.0b3 |
Modern, polished CLI to extract text from PDFs using the Mistral OCR API. |
2025-08-15 23:24:29 |
hashub-docapp |
1.0.0 |
Professional Python SDK for the HashubDocApp API - Advanced OCR, document conversion, and text extraction service |
2025-08-15 12:09:58 |
kokoro-tts |
2.2.1 |
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents. |
2025-08-14 22:13:00 |
streamlit-pdf |
1.0.6 |
A Streamlit component for viewing PDF files |
2025-08-14 20:48:20 |
llama-index-packs-resume-screener |
0.9.1 |
llama-index packs resume_screener integration |
2025-08-14 20:17:36 |
bulkinvoicer |
0.1.0.dev1 |
A simple python script to quickly create bulk invoices. |
2025-08-14 18:54:29 |
plutoprint |
0.3.0 |
Paged HTML rendering library |
2025-08-14 12:52:45 |
web2llm |
0.5.1 |
A tool to scrape web content into clean Markdown for LLMs. |
2025-08-14 08:53:14 |
aspose-cells |
25.8.0 |
Aspose.Cells for Python via Java is a high-performance library that unleashes the full potential of Excel in your Python projects. It can be used to efficiently manipulate and convert Excel and spreadsheet formats including XLS, XLSX, XLSB, ODS, CSV, and HTML - all from your Python code. Amazingly, it also offers free support. |
2025-08-14 02:30:43 |
pdfix-sdk |
8.7.3 |
PDFix SDK - Automated PDF Remediation, Data Extraction, HTML Conversion |
2025-08-14 00:23:04 |
llm-markdownify |
0.3.0 |
Convert PDFs, images to high-quality Markdown using Vision LLMs. |
2025-08-13 22:07:30 |
inkognito |
0.1.0 |
Privacy-first document processing FastMCP server with PII anonymization |
2025-08-13 17:45:52 |
lizeur |
0.1.3 |
Lizeur is a MCP server to be able to get content from PDFs. |
2025-08-13 17:21:00 |
aspose-words-cloud |
25.8.0 |
Python Cloud SDK wraps Aspose.Words Cloud API so you could seamlessly integrate Microsoft Word file generation, manipulation, conversion & inspection features into your own python applications. |
2025-08-13 12:45:20 |
surya-ocr |
0.15.4 |
OCR, layout, reading order, and table recognition in 90+ languages |
2025-08-12 23:21:48 |
docling |
2.44.0 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-08-12 09:52:48 |
pdfkb-mcp |
0.4.1 |
A Model Context Protocol server for managing PDF documents with vector search capabilities |
2025-08-12 04:10:04 |
diffpy.cmi |
0.0.1 |
Complex modeling infrastructure: a modular framework for multi-modal modeling of scientific data. |
2025-08-11 15:54:09 |
ipxact2systemverilog |
1.0.26 |
Generate VHDL, SystemVerilog, html, rst, md, pdf, c headers from an IPXACT description |
2025-08-11 11:20:59 |
docstrange |
1.1.3 |
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, JSON, CSV, HTML) with intelligent content extraction and advanced OCR. |
2025-08-11 07:10:23 |